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ABSTRACT 


The coronavirus disease 19 (COVID-19] is a highly transmittable and 
pathogenic viral infection caused by severe acute respiratory syndrome 
coronavirus 2 (SARS-CoV-2], which emerged in Wuhan, China and spread 
around the world. Genomic analysis revealed that SARS-CoV-2 is 
phylogenetically related to severe acute respiratory syndrome-like [SARS-like] 
bat viruses, therefore bats could be the possible primary reservoir. The 
intermediate source of origin and transfer to humans is not known, however, 
the rapid human to human transfer has been confirmed widely. 

In this document we will analyze the structure and diversity of the pathogen 
and we will also discuss the previous emergence of human coronaviruses like 
Severe Acute Respiratory Syndrome Coronavirus [SARS-CoV] and Middle East 
Respiratory Syndrome (MERS-CoV). 

GRAPHICAL ABSTRACT 


iMIm 







rhii r ipitrll r iHf ilailBi 


/iA 


KEYWORDS: Coronavirus, SARS-CoV-2/C0VID-19, Origin, Outbreak, Spread, 
Natural Analysis: Protein homology modelling Molecular Docking and Molecular 
dynamics simulation. 


INTRODUCTION: 

The coronavirus disease (COVlD-19] was identified as the 
cause of an outbreak of respiratory illness in Wuhan, Hubei 
province, China in the end December 2019 [1] . After an 
outbreak of coronavirus was experienced in China, the virus 
killed more than eighteen hundred and infected over seventy 
thousand individuals within first fifty days of the epidemic. 
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respiratory tract infections that can range from mild to 
lethal. Mild illnesses include some cases of the common cold 
(which is caused also by certain other viruses, 
predominantly rhinoviruses], while more lethal varieties can 
cause SARS, MERS, which can also cause present disease 
COVlD-19. 


The World Health Organization [WHO] initially named this 
pathogen as [2019-nCoV] [2]. On January 10, the first whole- 
genome sequencing of (2019-nCoV] was released which 
helped the researchers to identify the type of virus and its 
diversity. 

This virus was reported to be a member of the (3 group of 
coronaviruses and The International Committee on 
Taxonomy of Viruses [ICTV] named the virus as SARS-CoV-2 
and the disease as COVlD-19 [3],[4],[5]. 

Taxonomy of Coronaviruses 

Coronaviruses is virus genera within the virus family 
Coronaviridae, under order Nidovirales. 

Coronaviruses are a group of related RNA viruses that cause 
diseases in mammals and birds humans, these viruses cause 


Structure of Coronaviruses 

Corona represents crown-like spikes on the outer surface of 
the virus; thus, it was named as a coronavirus. Coronaviruses 
are minute in size [65-125 nm in diameter] and contain a 
single-stranded RNA as a nucleic material, size ranging from 
26-32 kbp [Fig.l] in length and is among one of the largest 
among viruses 
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Figure 1. Structure of respiratory syndrome causing 
human coronavirus. 

They are enveloped viruses with a positive-sense single- 
stranded RNA genome and a Nucleocapsid of helical 
symmetry. This is wrapped in icosahedral protein shell, They 
have characteristic club-shaped spikes that project from 
their surface. 

Etymology 

The name "coronavirus" is derived from Latin corona, 
meaning "crown" or "wreath". The name was coined by June 
Almeida and David Tyrrell who first observed and studied 
human coronaviruses The name refers to the 

characteristic appearance of Virions (the infective form of 
the virus] by electron microscopy, which have a fringe of 
large, bulbous surface projections, this morphology is 
created by the viral spike Peplomers, which are proteins on 
the surface of the virus. 



Figure2. Images of virions under Electron microscopy. 
Classification of Coronaviruses 

The classification of Coronaviruses has been based on 
genomic organizations, similarities in genomic sequence, 
antigenic properties viral proteins, replication strategies and 
structural characteristics virions, pathogenic, 
cytopathogenic, and physicochemical properties. 

Coronavihdae family is the largest one of the four families, 
Coronaviridae virus family subdivided into two subfamilies, 
coronavirinae and torovirinae 

The subgroups of coronaviruses family are alpha (a], beta 
(P], gamma (y] and delta (6] coronavirus. 

Alpha coronaviruses (Alpha-CoV]: are the first of the four 
genera of Coronaviruses, 

Species: Human coronavirus 229E, Human coronavirus 
NL63, bat coronavirus. 


*The novel strain 229E were subsequently imaged by 
electron microscopy in 1967 by Scottish virologist June 
Almeida at St. Thomas Hospital in London. 

Beta coronaviruses (P-CoVs or Beta-CoVs]: 

Species: SARS-CoV, MERS-CoV, SARS-CoV-2/COVID-19. 

Gamma coronavirus: 

Species: Avian coronavirus (IBV], Beluga whale coronavirus 
SWl 

Delta coronavirus: 

Species: Bulbul coronavirus HKUll 

History of previously occurred Human Coronaviruses: 

Coronavirus disease was first described in 1931, with the 
first coronavirus (HCoV-229E] isolated from humans in 
1965. the outbreak of severe acute respiratory syndrome in 
late 2002, only two Human Coronaviruses [HCoV] were 
known - HCoV-229E and HCoV-0C43. 

The SARS coronavirus (SARS-CoV] has been identified after 
other human coronaviruses were identified, therefore there 
are three human coronavirus groups that still exist: group 1 
(HCoV-229E and HCoV-NL63], group 2 (HCoV0C43 and 
HCoV-HKUl], and SARS-CoV is an outline between group 2 
and group 3. 

1. SARS-CoV : In 2 003, the Chinese population was infected 
with a virus causing Severe Acute Respiratory Syndrome 
(SARS] in Guangdong province. The virus was confirmed 
as a member of the Beta-coronavirus subgroup and was 
named SARS-CoV [7]. The infected patients exhibited 
pneumonia symptoms [31] with a diffused alveolar 
injury which lead to acute respiratory distress 
syndrome (ARDS]. 

Viral genome analyses revealed that SARS-CoV belongs 
to Group B. 

SARS initially emerged in Guangdong, China and then spread 
rapidly around the globe with more than 8000 infected 
persons and 776 deceases. 

A decade later in 2012, a couple of Saudi Arabian nationals 
were diagnosed to be infected with another coronavirus. The 
detected virus was confirmed as a member of Coronaviruses 
and was named as Middle East Respiratory Syndrome 
(MERS-CoV]. 

The SARS-related coronavirus (SARS-CoV] was one of 
several viruses identified by the World Health Organization 
(WHO] in 2016 as a likely cause of a future epidemic. 

2. MERS-CoV: In 2012, Saudi Arabian nationals were 
diagnosed to be infected with it. 

MERS-CoV is a member beta-coronavirus subgroup and 
phylogenetically diverse from other human-CoV. 

The infection of MERS-CoV initiates from a mild upper 
respiratory injury while progression leads to severe 
respiratory disease [8] . Similar to SARS-coronavirus, 
patients infected with MERS-coronavirus suffer pneumonia, 
followed by (ARDS] and renal failure. MERS-CoV belongs to 
Group C. 
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Figures. Graphical representation of MERS-CoV and its spread. 


Primary Hosts of Coronaviruses: 

The source of origination and transmission is important in order to understand the strategies to be applied in prevention from 
infection. 


1. SARS-CoV: Initially researchers focused on raccoon dogs and palm civets as a key reservoir of transmission, the samples 
were isolated from the civets at the food market and it showed positive results for viral RNA detection, suggesting that 
civets are the Secondary hosts. 

In 2001, the samples were isolated from the healthy persons of Hong Kong and the molecular assessment showed 2.5% 
frequency rate of anti-bodies against SARS-coronavirus. These indications suggested that SARS-coronavirus may be 
circulating in humans before the outbreak. 

Later on conducting study, Rhinolophus bats were also found to have anti-SARS-CoV antibodies suggesting the bats as a 
source of viral replication. 

And it was identified as. Bats are a major reservoir of many strains of SARS-related coronaviruses, and several strains have 
been identified in palm civets, which were likely ancestors of SARS-CoV. 

2. MERS-CoV: it first emerged in 2012 in Saudi Arabia. MERS-coronavirus also pertains to have camels as a zoonotic source 
or primary host. 

Later on detail study it was found that, the detected virus was confirmed as a member of coronaviruses and named as the 
Middle East Respiratory Syndrome Coronavirus (MERS-CoVJ. The World health organization reported that MERS- 
coronavirus infected more than 2428 individuals and 838 deaths [8]. 

MERS-CoV is a member beta-coronavirus subgroup and phylogenetically diverse from other human-CoV 


MERS-coronavirus was also detected in Pipistrellus and Perimyotis bats, proffering that bats are the key host and 
transmitting medium of the virus. 
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Figure4. The key reservoirs and mode of transmission of coronaviruses. 


(suspected reservoirs of SARS-CoV-2 are red encircled]; only a and (3 coronaviruses have the ability to infect humans, the 
consumption of infected animal as a source of food is the major cause of animal to human transmission of the virus and due to 
close contact with an infected person, the virus is further transmitted to healthy persons. Dotted black arrow shows the 
possibility of viral transfer from bat whereas the solid black arrow represents the confirmed transfer. 
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ABOUT 

Coronavirus Disease 2019 (COVlD-19) 

1. BASIC STRUCTURE: 

It is a positive-stranded RNA viruses with a crown-like appearance under an electron microscope, the prefix is derived from a 
Latin term [coronam - crown] due to the presence of spike glycoproteins on the envelope. 

It is also reported that Spike glycoprotein of the Wuhan coronavirus is modified via homologous recombination. 

The spike glycoprotein of SARS-CoV-2 is the mixture of bat SARS-CoV and a not known Beta-CoV. 

The spike protein of SARS-CoV-2 contains a 3-D structure in the RBD region to maintain the van der Waals forces. [32] 

2. GENOME 

Recent studies show that SARS-CoV-2 has a similar genomic organization to other coronaviruses, the genome has the usual 5' 
methylated cap and a 3' Poly Adenylated tail, consisting of a 5'-3' untranslated region (UTR], Open reading frame (orfl) 
encoding non-structural proteins, a spike protein (S] gene, envelope protein (E) gene, a membrane protein (M) gene, a 
Nucleicapsid protein (N) gene. 

There are 265 nucleotides in the 5'UTR and 342 nucleotides in 3'UTR. 


The 5' methylated cap and 3' Poly Adenylated tail allows the positive-sense RNA genome to be directly translated by the host 
cell's ribosome on viral entry. 


Studies also highlighted that SARS-CoV-2 genes share <80% nucleotide identity and 89.10% nucleotide similarity with SARS- 
CoV genes. 

Recent studies have indicated notable variations in SARS-CoV and SARS-CoV-2 such as the absence of 8a protein and fluctuation 
in the number of amino acids in 8b and 3c protein in SARS-CoV-2. 
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Figures. Schematic representation of the genome organization and functional domains of S protein for SARS-CoV and MERS- 
CoV. The single-stranded RNA genomes of SARS-CoV and MERS-CoV encode two large genes, the ORFla and ORFlb genes, 
which encode 16 non-structural proteins (nspl-nspl6] that are highly conserved throughout coronaviruses. The structural 
genes encode the structural proteins, spike (S], envelope [E], membrane (M], and nucleocapsid (N], which are common features 
to all coronaviruses. The accessory genes [shades of green] are unique to different coronaviruses in terms of number, genomic 
organization, sequence, and function. The structure of each S protein is shown beneath the genome organization. The S protein 
mainly contains the SI and S2 subunits. The residue numbers in each region represent their positions in the S protein of SARS 
and MERS, respectively. The S1/S2 cleavage sites are highlighted by dotted lines. SARS-CoV, severe acute respiratory syndrome 
coronavirus; MERS-CoV, Middle East respiratory syndrome coronavirus; CP, cytoplasm domain; FP, fusion peptide; HR, heptad 
repeat; RBD, receptor-binding domain; RBM, receptor-binding motif; SP, signal peptide; TM, transmembrane domain. 
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This polypeptide is proteolytic-ally cleaved to generate various proteins. The proteolytic processing is mediated by papain¬ 
like protease (PL pro) and 3-chymotrypsin-like protease (3CLpro). 

The 3CLpro cleaves the polyprotein at 11 distinct sites to generate various non-structural proteins that are important for viral 
replication, it also plays a critical role in the replication of virus particles and unlike structural/accessory protein-encoding 
genes, [8] It is located at the 3' end which exhibits excessive variability [30]. 


Structure-based activity analyses and high-throughput studies have identified potential inhibitors for SARS-CoV and MERS-CoV 
3CLpro [10]. 
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Figure6. Genomic structure of SARS-CoV and MERS-CoV. 


The typical coronavirus [CoV] genome is a single-stranded, non-segmented RNA genome, which is approximately 26-32 kb. It 
contains 5'-methylated caps and 3'-polyadenylated tails and is arranged in the order of 5', replicase genes, genes encoding 
structural proteins (spike glycoprotein (SJ, envelope protein (EJ, membrane protein (M] and nucleocapsid protein [NJ], 
polyadenylated tail and then the 3' end. The partially overlapping 5'-terminal open reading frame la/b (ORFla/b] is within the 
5' two-thirds of the CoV genome and encodes the large replicase polyprotein la (pplaj and pplab. These polyproteins are 
cleaved by papain-like cysteine protease [PLpro] and 3C-like serine protease (3CLpro] to produce non-structural proteins, 
including RNA-dependent RNA polymerase [RdRp] and helicase (HelJ, which are important enzymes involved in the 
transcription and replication of CoVs. The 3' one-third of the CoV genome encodes the structural proteins (S, E, M and N], which 
are essential for virus-cell-receptor binding and virion assembly, and other non-structural proteins and accessory proteins that 
may have immunomodulatory effects. 
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Figure?, graphical abstract of SARS-CoV2 Cell entry: 


SARS-CoV-2 uses the SARS-CoV receptor ACE2 for host cell entry 
The spike protein of SARS-CoV-2 is primed by TMPRSS2 

Antibodies against SARS-CoV spike may offer some protection against SARS-CoV-2 

The single N501T mutation in SARS-CoV-2's Spike protein may have significantly enhanced its binding affinity for ACE2. 

The 394 glutamine residue in the MPR region of SARS-CoV-2 is recognized by the critical lysine 31 residue on the human ACE2 
receptor [33]. 

In a fluorescent study, SARS-CoV-2 uses the SARS-CoV receptor ACE2 for entry and the serine protease TMPRSS2 for S protein 
priming. 
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3. SEQUENCE ANALYSIS 

In order to identify similar sequences and key/conserved residues, and to infer phylogeny, multiple sequence alignment of 
SARS-CoV-2 3 CLp^° followed by phylogenetic tree analyses were performed using T-Coffee [15] and the alignment Figurewas 
generated using ESPriptS [16]. Physicochemical parameters of SARS-CoV-2 3 CLp^° including isoelectric point, instability index, 
grand average of hydropathicity [GRAVY], and amino acid and atomic composition were investigated using the ProtParam tool 
of ExPASy [14]. 

4. STRUCTURAL ANALYSIS 

To probe the molecular architecture of SARS-CoV-2 3 CLp^°, comparative homology modelling was performed using Modeller 
v9.11 [17]. To select closely-related templates for modelling, PSI-BLAST was performed against all known structures in the 
protein databank [PDB] [18]. Chimera vl.8.1 [19] and PyMOL educational version [20] were used for initial quality estimation, 
energy minimisation, mutation analyses, and image processing. 

5. LIGAND AND DATABASE PREPARATION: 

A comprehensive medicinal plant library containing 32,297 potential anti-viral phytochemicals and traditional Chinese 
medicinal compounds was generated from our previously collected data and studies [13,[21], [22], [23]], and screened against 
the predicted SARS-CoV-2 3 CLp^° structure. Molecular operating environment [MOE] [24] was used for molecular docking, 
ligand-protein interaction and drug likeness analyses. All analyses were performed using the same protocols that are already 
described in our previous studies [13,25,26]. The qualitative assessment of absorption, deposition, metabolism, excretion and 
toxicity [ADMET] profile of selected hits were predicted computationally by using ADMET sar server [27]. 

6. MOLECULAR DYNAMICS AND SIMULATIONS: 

Explicit solvent molecular dynamics [MD] simulations were performed to verify docking results and to analyze the binding and 
stability of potential compounds by using the predicted SARS-CoV-2 3CLpr° homology model. 

GROMACS v5.1.4, GR0M0S96 and the PRODRG server were employed to run 50 ns MD simulations [28,29] following the same 
protocol as described in our previous studies [13]. 

7. SEQUENCE AND STRUCTURAL ANALYSIS RESULTS: 

Multiple sequence alignment results revealed that 3 CLpro was conserved, with 100% identity among all SARS-CoV-2 genomes. 

The SARS-CoV-2 3CLpro protein sequence was compared with its closest homologs (Bat-CoV, SARS-CoV, MERS-CoV, Human- 
CoV and Bovine-CoV]. 

The results revealed that SARS-CoV-2 3CLpro clustered with bat SARS-like coronaviruses and sharing 99.02% sequence 
identity [FiglA]. 

Furthermore, it shares 96.08%, 87.00%, 90.00% and 90.00% sequence identity with SARS-CoV, MERS-CoV, Human-CoV and 
Bovine-CoV homologs, respectively 
[Fig. IB]. 

These finding were consistent with an initial study reporting that SARS-CoV-2 is more similar to SARS-CoV than MERS-CoV, and 
shares a common ancestor with bat coronaviruses. 



Figures. Phylogenetic tree inferred from closest homologs of SARS-CoV-2. 
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The maximum likelihood method was used to construct this tree. (B] Multiple sequence alignment of closest homologs of SARS- 
CoV-2 3CLpro sharing >70% sequence identity. (CJ Cartoon representation of the SARS-CoV-2 3CLpro homodimer. Chain-A 
(protomer-AJ is in multicolor and Chain-B (protomer-BJ is in dark blue. The N-finger that plays an important role in 
dimerization maintaining the active conformation is shown in hot pink, domain I is colored cyan, domain II is shown in green, 
and domain III is colored yellow. 

The N- and C-termini are labelled. Residues of the catalytic dyad [Cys-145 and His-41] are highlighted in yellow and labelled. 
[D] Cartoon representation of the 3CLpro monomer model [chain/protomer-A] ofSARS-CoV-2 superimposed with the SARS- 
CoV 3CLpro structure. The SARS-CoV 3CLpro template is colored cyan, the SARS-CoV-2 3CLpro structure is colored grey, and all 
identified mutations are highlighted in red. (EJ Docking of 5,7,3',4'-tetrahydroxy-2'-(3,3-dimethylallyl] flavone inside the 
receptor-binding site of SARS-CoV-2 3CLpro, showing hydrogen bonds with the catalytic dyad (Cys-145 and His-41]. The 
3CLpro structure is colored dark blue, the 5,7,3',4'-tetrahydroxy-2'-(3,3-dimethylallyl] flavone is orange, and hydrogen colored 
maroon. 

8. MOLECULAR DOCKING: 

To test this hypothesis, sequences were docked (R]-N-(4-(tert-phenyl]-N-[2-(tert-butylamino]-2-oxo-l-(pyridin-3-ethyl furan- 
2-carboxamide]], a potential noncovalent inhibitor of SARS-CoV 3CLpro named ML188 [35], with the SARS-CoV-2 3CLpro 
homology model. 

The inhibitor target is to bind Cys-His catalytic dyad (Cys-145 and His-41] with the other residues, and increase as a result the 
docking score (S = -12.27] relatively high. 



However, surprisingly, ML188 did not show significant binding to the catalytic dyad (Cys-145 and His-41] ofSARS-CoV-2,and 
the docking score (S = -8.31] was considerably lower. 


These results indicated that the 12-point mutation is identified which may disrupt important hydrogen bonds and alter the 
receptor binding site. 


9. PHYLOGENETIC TREE: 
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The phylogenetic tree shows the relationship of Wuhan-Hu-1 (denoted as red] to selected coronavirus is based on nucleotide 
sequences of the complete genome. The viruses are grouped into four genera. 

This tree is based on the published trees of Coronaviruses and is reconstructed with sequences of the complete RNA- 
dependent RNA polymerase- coding region of the representative novel coronaviruses. 

Severe Acute Respiratory Syndrome Coronavirus (SARS- CoV], The Middle East respiratory syndrome coronavirus (MERS- 
CoV]; Porcine Enteric Diarrhea Virus (PEDV]; Wuhan seafood market pneumonia (Wuhan-Hu-1]. 

Genetic analysis has revealed that the coronavirus genetically clusters with the genus Beta coronavirus, in subgenus 
Sarbecovirus (lineage B] together with two bat-derived strains. It is 96% identical at the whole genome level of other bat 
coronavirus samples 
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Recent studies have indicated notable variations in SARS-CoV and SARS-CoV-2 such as the absence of 8a protein and fluctuation 
in the number of amino acids in 8b and 3c protein in SARS-CoV-2 [34] (Fig. 4]. It is also reported that Spike glycoprotein of the 
Wuhan coronavirus is modified via homologous recombination. 

(SARS-CovJ. The results suggested that homologous recombination may have occur and contributed to the 2019-nCoV cross¬ 
species transmission. 
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Figure9. Betacoronaviruses genome organization; 


The Betacoronavirus for human (SARS-CoV-2, SARS-CoV and MERS-CoV] genome comprises of the 5'-untranslated region (5'- 
UTR], open reading frame (orf) la andla/b [green box) encoding non-structural proteins (nsp) for replication, structural proteins 
including spike [blue box), envelop [maroon box), membrane [pink box), and nucleocapsid [cyan box) proteins, accessory proteins 
(light gray boxes] such as orf 3,4a, 4b 6, 7a, 7b, 8b and 9b in the SARS-CoV-2 genome, and the 3'-untranslated region (3'-UTR]. 

The doted underlined in red are the protein which shows key variation between SARS-CoV-2 and SARS-CoV. 


SARS-CoV Showed high sequence identity to SARS-CoV-2. 
CONCLUSION: 

The novel coronavirus originated from the Hunan seafood 
market at Wuhan, China where bats, snakes, raccoon dogs, 
palm civets, and other animals are sold, has rapidly spread 
up to 109 countries. The zoonotic source of SARS-CoV-2 is 
not confirmed, however, sequence-based analysis 
suggested, bats as tbe key reservoir. 

DNA recombination was found to be involved at spike 
glycoprotein which assorted SARS-CoV with the Receptor 
Binding Domain (RBD] of another Beta Coronavirus, thus 
could be the reason for cross-species transmission and rapid 
infection. According to phylogenetic trees, SARS-CoV2 is 
closer to SARS-CoV. 

Upon deep analysis it was also found that, 3CLp^^ is conserved 
in SARS-CoV-2. 

It is highly similar to bat SARS-like coronavirus 3 CLp*’o, with 
some differences from other beta-coronaviruses. 

SARS-CoV-2 3 CLp^° is conserved, share 99.02% sequence 
identity with SARS-CoV 3 CLp^^ and together with 12 point- 
mutations. 

These 12 point Mutations also disrupt hydrogen bonds and 
alter the receptor binding site of SARS-CoV-2 3 CLp^° 

Taking advantage of all the genomic information currently 
available, researchers constructed a phylogenetic tree 
including representatives of other coronaviridae members, 
such as Bat coronavirus (BCoV] and severe acute respiratory 
syndrome (SARS]. It was confirmed that high sequence 
similarity (>99%] is observed between all sequenced 
2019-nCoVs genomes available, with the closest BCoV 


sequence sharing 96.2% sequence identity, confirming the 
notion of a zoonotic origin of 2019-nCoV genome. 

Finally, a full proteomic comparison with other 
coronaviridae members, by identifying key amino acids and 
its differences are considered for deriving it relationship 
from previously coronavirus approaches. 
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